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RAW SEQUENCE LISTING DATE: 02/01/2002 

PATENT APPLICATION: US/10/006 , 163 TIME: 11:47:54 

Input Set : N:\Crf3\RULE60\10006163.raw 
Output Set: N:\CRF3\02012002\J006163.raw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Lai, Preeti 

6 Corley, Neil C. 

8 (ii) TITLE OF INVENTION: HUMAN SHORT CHAIN DEHYDROGENASE 

10 (iii) NUMBER OF SEQUENCES: 3 

12 (iv) CORRESPONDENCE -ADDRESS: 

13 (A) ADDRESSEE: Incyte Pharmaceuticals, inc. 

14 (B) STREET: 3174 Porter Dr. 

15 (C) CITY: Palo Alto 

16 (D) STATE: CA 

17 (E) COUNTRY: USA 

18 (F) ZIP: 94304 

20 (V) COMPUTER READABLE FORM: 

21 (A) MEDIUM TYPE: Diskette 

22 (B) COMPUTER: IBM Compatible 

23 (C) OPERATING SYSTEM: DOS 

24 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
26 (vi) CURRENT APPLICATION DATA: 

C--> 27 (A) APPLICATION NUMBER: US/10/006 , 163 

C--> 28 (B) FILING DATE: 04-Dec-2001 

29 (C) CLASSIFICATION: 

31 (vii) PRIOR APPLICATION DATA: 

32 (A) APPLICATION NUMBER: US/09/249,241 

33 (B) FILING DATE: 

36 (viii) ATTORNEY/AGENT INFORMATION: 

37 (A) NAME: Billings, Lucy J. 

38 (B) REGISTRATION NUMBER: 36,749 

39 (C) REFERENCE/DOCKET NUMBER: PF-0475 US 

41 (ix) TELECOMMUNICATION INFORMATION: 

42 (A) TELEPHONE: 650-855-0555 

43 (B) TELEFAX: 650-845-4166 

44 (C) TELEX: 
47 (2) INFORMATION FOR SEQ ID NO: 1: 

49 (i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 313 amino acids 

51 (B) TYPE: amino acid 

52 (C) STRANDEDNESS : single. 

53 (D) TOPOLOGY: linear 

55 (vii) IMMEDIATE SOURCE: 

56 (A) LIBRARY: PROSNOT01 

57 (B) CLONE: 356351 

59 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
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RAW SEQUENCE LISTING DATE: 02/01/2002 

PATENT APPLICATION: US/10/006 f 163 TIME: 11:47:54 

Input Set : N:\Crf3\RULE60\10006163.raw 
Output Set: N:\CRF3\02012002\J006163.raw 
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84 180 185 190 

85 Pro Gly He Val Gin Thr Glu Leu Leu Lys Glu His Met Ala Lys Glu 

86 195 200 205 

87 Glu Val Leu Gin Asp Pro Val Leu Lys Gin Phe Lys Ser Ala Phe Ser 

88 210 215 220 

89 Ser Ala Glu Thr Thr Glu Leu Ser Gly Lys Cys Val Val Ala Leu Ala 

90 225 230 235 240 

91 Thr Asp Pro Asn He Leu Ser Leu Ser Gly Lys Val Leu Pro Ser Cys 
92. 245 250 255 

93 Asp Leu Ala Arg Arg Tyr Gly Leu Arg Asp Val Asp Gly Arg Pro Val 

94 ~ 260 ^ 265 270 

95 Gin Asp Tyr Leu Ser Leu Ser Ser Val Leu Ser His Val Ser Gly Leu 

96 275 280 285 

97 Gly Trp Leu Ala Ser Tyr Leu Pro Ser Phe Leu Arg Val Pro Lys Trp 

98 290 295 300 

99 He He Ala Leu Tyr Thr Ser Lys Phe 

100 305 310 

102 (2) INFORMATION FOR SEQ ID NO : 2 : 

104 (i) SEQUENCE CHARACTERISTICS: 

105 (A) LENGTH: 1387 base pairs 

106 (B) TYPE: nucleic acid 

107 (C) STRANDEDNESS : single 

108 (D) TOPOLOGY: linear 

110 (vii) IMMEDIATE SOURCE: 

111 (A) LIBRARY: PROSNOT01 

112 (B) CLONE: 356351 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/006 , 163 



DATE: 02/01/2002 
TIME: 11:47:54 



Input Set : N:\Crf3\RULE60\10006163.raw 
Output Set: N:\CRF3\02012002\J006163.raw 

114 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

116 CTAACTTTGG CCTGGGACTC TGCCCCTCTA CCTCAGCACA GAATCGCCCC 

117 ACAGAATCAA TCCTTGAACA CTGCCTCCAC GTCGCCGGCT CAATCTGGGC 

118 ACTTCCACCG CAGCCCCGCA ATCTGCAGAC CTCAGCGGCA GCGCAGGTGG 

119 TCCTTTGCCT GTGAGTCATG GCAGCTCCCA TGAATGGCCA AGTGTGTGTG 

120 CCTCCAGGGG TATTGGCCGT GGCATTGCCT TGCAGCTCTG CAAAGCAGGC 

121 ACATCACTGG CCGCCATCTG GACACCCTTC GCGTTGTTGC TCAGGAGGCA 

122 GGGGCCAATG TGTGCCTGTG GTGTGCGATT CAAGCCAGGA GAGTGAAGTG 

123 TTGAGCAAGT GGATCGGGAA CAGCAAGGGC GTCTAGATGT GCTGGTCAAC 

124 CAGGGGTCCA GACGATCCTG AACACCAGGA ATAAGGCATT CTGGGAAACC 

125 TGTGGGATGA TATCAACAAC GTCGGACTCA GAGGCCACTA CTTTTGCTCA 

126 CACGGCTGAT GGTACCAGCT GGCCAGGGGC TCATCGTGGT CATCTCCTCC 

127 TGCAGTATAT GTTCAATGTC CCCTATGGTG TGGGCAAAGC TGCGTGTGAC 

128 CTGACTGTGC CCACGAGCTG CGGCGCCATG GGGTCAGCTG TGTGTCTCTG 

129 TTGTGCAGAC AGAACTGCTG AAGGAGCATA TGGCAAAGGA GGAGGTCCTG 

130 TGTTGAAGCA GTTCAAATCA GCCTTCTCAT CTGCAGAAAC CACAGAATTG 

131 GTGTGGTGGC TTTGGCAACA GATCCCAATA TCCTGAGCCT GAGTGGTAAG 

132 CCTGTGACCT TGCTCGACGC TATGGCCTTC GGGATGTGGA CGGCCGCCCC 

133 ATTTGTCTTT GAGCTCTGTT CTCTCACACG TGTCCGGCCT GGGCTGGCTG 

134 TGCCCTCCTT CCTCCGTGTG CCCAAGTGGA TTATTGCCCT CTACACTAGC 

135 CCTCCTGGTC TGACACTACG TCTCTGCTTG TCTTCTCATT TGGACTTGGT 

136 GTCTCAGTGA AACAGCAGCC TTTCTTGTTT ACCCATACCC TTGATATGAA 

137 CTGCTGTGTG TCCGTGGTGA GTTCTGGGGT GCGCCTAGGT CCCTTCTTTG 

138 TTCCTTGTCC TTCTTTTTAC TTTTTGCCTT AGTATTGAAA AATGCTCTTG 

139 AAGTCTA 

141 (2) INFORMATION FOR SEQ ID NO : 3: 

143 (i) SEQUENCE CHARACTERISTICS: 

144 (A) LENGTH: 323 amino acids 

145 (B) TYPE: amino acid 

146 (C) STRANDEDNESS : single 

147 (D) TOPOLOGY: linear 

149 (vii) IMMEDIATE SOURCE: 

150 (A) LIBRARY: GenBank 

151 (B) CLONE: 2315796 

153 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

155 Met Gly Val He Leu Gin Asp Gin Val Ala Leu Val Thr 

156 15 10 

157 Arg Gly He Gly Arg Gly He Ala Leu Gin Leu Gly Glu 

158 20 25 

159 Thr Val Tyr He Thr Gly Arg Arg Pro Glu Leu Ser Asp 

160 35 40 45 

161 Leu Gly Leu Pro Ser Leu Asp Tyr Val Ala Lys Glu He 

162 50 55 60 

163 Gly Gly Lys Gly He Ala Leu Tyr Val Asp His Ser Asn 

164 65 70 75 

165 Val Lys Phe Leu Phe Glu Lys lie Lys Glu Asp Glu Glu 

166 85 90 

167 Asp He Leu Val Asn Asn Val Tyr Asn Ser Leu Gly Lys 

168 100 105 



GGGTCCTACT 
GAGAACCCAG 
CAGACCTGCC 
GTGACTGGTG 
GCCACAGTTT 
CAATCCCTCG 
CGAACGCTGT 
AATGCTTATG 
CCTGCCTCCA 
GTGTATGGGG 
CCAGGAAGCC 
AAGCTGGCTG 
TGGCCGGGGA 
CAGGATCCTG 
AGTGGCAAAT 
GTGCTGCCAT 
GTCCAAGACT 
GCCTCCTACC 
AAGTTCTAAC 
GGTTCGTCCT 
GAGAAGCCCT 
TGCCTTGGTT 
GAGCTAATAA 
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VERIFICATION SUMMARY DATE: 02/01/2002 

PATENT APPLICATION: US/10/006 , 163 TIME: 11:47:55. 

Input Set : N:\Crf3\RULE60\10006163.raw 
Output Set: N:\CRF3\02012002\J006163.raw 

L:27 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER: ] 
L:28 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
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